Recovering a Hidden Community Beyond the Spectral Limit in O(|E|log*|V|) Time

نویسندگان

  • Bruce E. Hajek
  • Yihong Wu
  • Jiaming Xu
چکیده

The stochastic block model for one community with parameters n,K, p, and q is considered: K out of n vertices are in the community; two vertices are connected by an edge with probability p if they are both in the community and with probability q otherwise, where p > q > 0 and p/q is assumed to be bounded. An estimator based on observation of the graph G = (V,E) is said to achieve weak recovery if the mean number of misclassified vertices is o(K) as n → ∞. A critical role is played by the effective signal-to-noise ratio λ = K2(p− q)2/((n−K)q). In the regime K = Θ(n), a näıve degree-thresholding algorithm achieves weak recovery in O(|E|) time if λ → ∞, which coincides with the information theoretic possibility of weak recovery. The main focus of the paper is on weak recovery in the sublinear regime K = o(n) and np = n. It is shown that weak recovery is provided by a belief propagation algorithm running for log∗(n)+O(1) iterations, if λ > 1/e, with the total time complexityO(|E| log∗ n). Conversely, no local algorithm with radius t of interaction satisfying t = o( logn log(2+np) ) can asymptotically outperform trivial random guessing if λ ≤ 1/e. By analyzing a linear message-passing algorithm that corresponds to applying power iteration to the non-backtracking matrix of the graph, we provide evidence to suggest that spectral methods fail to provide weak recovery if λ ≤ 1.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recovering a Hidden Community in a Preferential Attachment Graph

Abstract—A message passing algorithm is derived for recovering a dense subgraph within a graph generated by a variation of the Barabási-Albert preferential attachment model. The estimator is assumed to know the arrival times, or order of attachment, of the vertices. The derivation of the algorithm is based on belief propagation under an independence assumption. Two precursors to the message pas...

متن کامل

Spectral Estimation of Stationary Time Series: Recent Developments

Spectral analysis considers the problem of determining (the art of recovering) the spectral content (i.e., the distribution of power over frequency) of a stationary time series from a finite set of measurements, by means of either nonparametric or parametric techniques. This paper introduces the spectral analysis problem, motivates the definition of power spectral density functions, and reviews...

متن کامل

ar X iv : 0 80 3 . 09 29 v 4 [ cs . D S ] 1 8 N ov 2 00 9 Graph Sparsification by Effective Resistances ∗

We present a nearly-linear time algorithm that produces high-quality spectral sparsifiers of weighted graphs. Given as input a weighted graph G = (V, E, w) and a parameter ! > 0, we produce a weighted subgraph H = (V, Ẽ, w̃) of G such that |Ẽ| = O(n log n/!) and for all vectors x ∈ R (1 − !) ∑ uv∈E (x(u) − x(v))2wuv ≤ ∑ uv∈Ẽ (x(u) − x(v))2w̃uv ≤ (1 + !) ∑ uv∈E (x(u) − x(v))2wuv. (1) This improves...

متن کامل

Semidefinite Programs for Exact Recovery of a Hidden Community

We study a semidefinite programming (SDP) relaxation of the maximum likelihood estimation for exactly recovering a hidden community of cardinality K from an n× n symmetric data matrix A, where for distinct indices i, j, Aij ∼ P if i, j are both in the community and Aij ∼ Q otherwise, for two known probability distributions P and Q. We identify a sufficient condition and a necessary condition fo...

متن کامل

Submatrix localization via message passing

The principal submatrix localization problem deals with recovering a K ×K principal submatrix of elevated mean μ in a large n × n symmetric matrix subject to additive standard Gaussian noise. This problem serves as a prototypical example for community detection, in which the community corresponds to the support of the submatrix. The main result of this paper is that in the regime Ω( √ n) ≤ K ≤ ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1510.02786  شماره 

صفحات  -

تاریخ انتشار 2015